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(54) Abstract Title 

Conversion of video to a streaming slide show 



(57) A method of converting motion based video to a 
streaming slide show comprising the steps of receiving 
the video as a series of images, selecting a number of 
images based on the desired bandwidth for transmission 
and generating a streaming slide show using the selected 
images. The video may further comprise audio which 
may be synchronised with the selected frames and the 
number of images selected may be determined according 
to a bit rate budget. Preferably the step of selecting 
images includes the steps of selecting candidate frames, 
ranking the candidate frames and then selecting from the 
candidate frames. 
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CONVERSION OP VIDEO AND AOTIO TO A STREAMING SLIDE SHOW 

This invention relates in general to video and audio transmission 
systems performed by computers, and in particular, to conversion of video 
and audio to a streaming slide show. 

For nearly half a century computers have been used by businesses to 
manage information such as numbers and text, mainly in the form of coded 
data. However, business data represents only a small part of the world's 
information. As storage, communication and information processing 
technologies advance, and as their costs come down, it becomes more 
feasible to digitize other types of data, store large volumes of it, and be 
able to distribute it on demand to users at their place of business or home 
via a network. 

New digitization technologies have emerged in the last decade to 
digitize images, audio, and video, giving birth to a new type of digital 
multimedia information. These multimedia objects are quite different from 
the business data that computers managed in the past, and often require 
more advanced information management system infrastructures with new 
capabilities. 

Multimedia data is typically not fully pre-structured (i.e., its use 
is not fully predictable) because it is the result of the creation of a 
human being or the digitization of an object of the real world (e.g., 
movies) . The digitization of multimedia information (image, audio, video) 
produces a large set of bits called an -object" or -large object- (LOB) or 
-binary large object" (BLOB). For example, a digitization of a movie, even 
after compression, may take as much as the equivalent of several billions 
of characters (3-4 GB) to be stored. 

As more users are networked together, there is an increase in the 
storage of multimedia data, such as video and audio data, with transmission 
of the multimedia data to users via the network. However, full 
motion-based and/or full resolution videos are by nature large and, 
therefore, demand high bit rates for transmission over networks or modems. 
A motion-based video is a series of frames (i.e.. a sequence of single 
still images) that are displayed in a manner that results in an optical 
illusion of motion, as perceived by a viewer. The bit rate or bandwidth 
refers to an amount of data that can be transmitted in a given period over 



a transmission channel (e.g., a network) and is typically represented as 
bits per second (bps) . 

The size of a video may result in very long download delays, greatly 
reduced resolution and quality, and, typically, very small image sizes, 
which render the original content difficult to view. 

Thus, there is a need in the art for an improved technique for 
transmitting video data. 

To seek to overcome the limitations in the prior art described above, 
and to seek to overcome other limitations that will become apparent upon 
reading and understanding the present specification, the present invention 
discloses a method, apparatus, and article of manufacture for conversion of 
video and audio to a streaming slide show. 

According to a first aspect of the present invention, a method of 
processing a video stored on a data store connected to a computer, the 
method comprises the steps of: 

receiving a motion-based video comprised of a series of images; 

selecting one or more images from the motion-based video based on a 
desired bandwidth for transmission; and 

generating a streaming slide show using the selected images. 

According to a second aspect of the present invention, an apparatus 
for processing a video, comprises: 

a computer having a data store coupled thereto, wherein the data 
store stores the video; and 

one or more computer programs, performed by the computer, for 
receiving a motion-based video comprised of a series of images, selecting 
one or more images from the motion-based video based on a desired bandwidth 
for transmission, and generating a streaming slide show using the selected 
images . 

According to a third aspect of the present invention, an article of 
manufacture comprising a program storage medium readable by a computer and 



embodying one or more instructions executable by the computer to perform 
method steps for processing a video stored on a data store connected to the 
computer, the method comprising the steps of: 

receiving a motion-based video comprised of a series of images; 

selecting one or more still images from the motion-based video based 
on a desired bandwidth for transmission; and 

generating a streaming slide show using the selected images. 

According to an embodiment of the invention, a video stored on a data 
store connected to a computer is processed. Initially, a motion-based 
video comprised of a series of images is received. One or more images are 
selected from the motion-based video based on a desired bandwidth for 
transmission. Then, a streaming slide show is generated using the selected 
images. 

For a better understanding of the present invention reference will 
now be made, by way of example, to the accompanying drawings in which like 
reference numbers represent corresponding parts throughout and in which: 

FIG. 1 is a hardware environment used to implement an embodiment of 
the invention; and 

FIG. 2 is a flow diagram illustrating the steps performed by the 
conversion system. 

In the following description of an embodiment of the invention, 
reference is made to the accompanying drawings which form a part hereof, 
and in which is shown by way of illustration a specific embodiment in which 
the invention may be practised. It is to be understood that other 
embodiments may be utilized and structural and functional changes may be 
made without departing from the scope of the present invention. 

FIG. 1 schematically illustrates the hardware environment of an 
embodiment of the present invention, and more particularly, illustrates a 
typical distributed computer system using a network 100 to connect client 
computers 102 executing client applications to a server computer 104 
executing software and other computer programs, and to connect the server 
system 104 to data sources 106 and video sources 112. A data source 106 



may comprise, for example, a multi -media database containing video. A 
video source 112 may comprise, for example, a live video stream or images 
from a camera. 

A typical combination of resources may include client computers 102 
that are personal computers or workstations, and a server computer 104 
that is a personal computer, workstation, minicomputer, or mainframe. 
These systems are coupled to one another by various networks, including 
LANs, wans, SNA networks, and the Internet. Each client computer 102 and 
the server computer 104 additionally comprise an operating system and one 
or more computer programs. 

A client computer 102 typically executes a client application and is 
coupled to a server computer 104 executing one or more server software. 
The client application may be a computer program such as a video player. 
The server software may include a conversion system 110, which is a 
computer program for converting video to a streaming slide show. The 
server computer 104 also uses a data source interface and, possibly, other 
computer programs, for connecting to the data sources 106. The client 
computer 102 is bi-directionally coupled with the server computer 104 over 
a line or via a wireless system. In turn, the server computer 104 is 
bi-directionally coupled with data sources 106. 

The operating system and computer programs are comprised of 
instructions which, when read and executed by the client and server 
computers 102 and 140, cause the client and server computers 102 and 140 to 
perform the steps necessary to implement and/or use the present invention. 
Generally, the operating system and computer programs are tangibly embodied 
in and/or readable from a device, carrier, or media, such as memory, other 
data storage devices, and/or data communications devices. Under control of 
the operating system, the computer programs may be loaded from memory, 
other data storage devices and/or data communications devices into the 
memory of the computer for use during actual operations. 

Thus, the present invention may be implemented as a method, 
apparatus, or article of manufacture using standard programming and/or 
engineering techniques to produce software, firmware, hardware, or any 
combination thereof. The term "article of manufacture 1 ' (or alternatively, 
"computer program product") as used herein is intended to encompass a 
computer program accessible from any computer- readable device, carrier, or 
media. Of course, those skilled in the art will recognize many 
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modifications may be made to this configuration without departing from the 
scope of the present invention. 

Those skilled in the art will recognize that the exemplary 
5 environment illustrated in FIG. 1 is not intended to limit the present 

invention. Indeed, those skilled in the art will recognize that other 
alternative hardware environments may be used without departing from the 
scope of the present invention. 

10 An embodiment of the invention provides a conversion system 110 . The 

conversion system 110 receives a motion-based video (e.g., a movie). The 
motion-based video may have an audio component (referred to as an audio 
stream) , as well as a video component (referred to as a video stream) . The 
conversion system 110 converts the motion-based video into a series of 

15 slides (i.e., a streaming slide show). If the motion-based video has an 

audio component, the conversion system 110 incorporates the audio into the 
streaming slide show. The streaming slide show includes all of the audio 
component and selected portions of the video component. By creating a 
streaming slide show, the conversion system 110 reduces the size of the 

20 data to be transmitted. Then, the conversion system 110 transmits the 

streaming slide show, instead of the video. This avoids the problems 
associated with transmitting video, such as download delays or poor 
resolution of the video. 

25 The conversion system 110 allows full resolution images to be 

displayed with synchronized audio, but as a "slide show" of individual 
images rather than as a motion-based video. A motion-based video is a 
series of frames (i.e., a sequence of single still images) that are 
displayed in a manner that results in an optical illusion of motion, as 

30 perceived by a viewer. 

On the other hand, some conventional systems allow for selection of 
images, and these are displayed as "thumbnails", which are tiny, 
compressed images. For example, some conventional systems select frames 

3 5 from a video, using techniques, such as detecting scene changes. Then, 

these conventional systems create a "storyboard" or display with small 
sized images of the selected frames. The result is typically a set of low 
resolution, poor quality images that are difficult for a viewer to look 
at. Additionally, these thumbnails are built around scene changes, without 

40 regard to maintaining a desired bit rate. 
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To avoid the problems of conventional systems, the conversion system 
110 creates a streaming slide show by extracting key content video frames 
from the motion-based video. In particular, the conversion system 110 
receives a motion-based video. Then, the conversion system 110 analyzes 
5 the motion-based video to locate and mark key frames. Initially, the 

conversion system selects a set of candidate key frames. The selection of 
a set of candidate key frames is based on technologies well known in the 
art, such as scene change detection, camera pan, elapsed time, etc. The 
selected candidate key frames are converted into standard compressed image 
10 files (e.g., Joint Photographic Experts Group (JPEG)), resulting in 

candidate still images . 

From the candidate still images, the conversion system 110 further 
selects slide show images that are to be combined with the audio. The 

15 conversion system 110 selects slide show images based on either a constant 

or variable bit rate based, for example, on user input. These slide show 
images are linked together and combined with audio to meet a specified bit 
rate and quality target (e.g., desired resolution or specific streaming 
rate for a target modem) . The conversion system 110 is advantageous in 

2 0 that it selects images in a manner that provides a proper sequence of 

images that represent the important content, yet still maintains a smooth 
image flow, without exceeding the delivery bandwidth capacity. 

Then, the conversion system 110 combines the selected slide show 
25 images with the audio component into a data stream. As an additional 

enhancement, the conversion system 110 can compress the audio component 
using well known technologies (e.g., subsambling, white space compression, 
etc.) to further reduce the data rate requirements while still maintaining 
the critical audio content. 

30 

The conversion system 110 outputs full resolution "slides" 
synchronized with the audio. This streaming slide show is most 
advantageous for low bit rate delivery mechanisms (e.g., modems) as well as 
networks. Having high quality audio with full resolution and high quality 

35 images, even on very low bit rate networks or connections, allows a user to 

hear all of the important audio information, while viewing the full 
resolution images. In most cases, the critical information is in the audio 
or captured in the key images, and not contained in the motion. Therefore, 
maintaining high quality of the key images, along with full audio, for the 

40 available bandwidth, allows a much better viewing experience. 
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FIG. 2 is a flow diagram illustrating the steps performed by the 
conversion system 110. Initially, in block 200, the conversion system 110 
receives a full motion-based video comprised of synchronized audio and 
video components. For example, this motion-based video may be in a Moving 
Pictures Expert Group (MPEG) format. In block 202, the conversion system 
no processes the motion-based video to select candidate key frames (full 
resolution or stand-alone, independent video frames) . In particular, the 
conversion system 110 analyzes the motion-based video to determine which 
frames represent important events (e.g., scene changes, camera pans or 
zooms, context changes, and other video events) . This process is completed 
using technology well known to those familiar with the art. 



Next, in block 204, the conversion system 110 generates candidate 
still images from the candidate key frames in standard formats (e.g., Joint 
15 Photographic Experts Group (JPEG) , Graphics Interchange Format (GIF) , or 

bit mapped graphics (BMP)). In block 206, the conversion system 110 stores 
these candidate still images with time base references. The time base 
references will be used by the conversion system 110 for synchronization 
with audio. 

20 

Then, the conversion system 110 processes the audio component in 
block 208. Optionally, the conversion system 110 compresses the audio 
component to reduce data rate, while still maintaining the time based 
synchronization information. Additionally, the conversion system 110 may 
25 remove white space from the audio. 

In block 210, the conversion system 110 selects slide show images 
from the candidate still images based on bit rate, similarity of content 
with the previous image, the relative importance of the image compared to 
30 other candidate images based on similarities or differences, and the 

overall timing of frames necessary to achieve a smooth flow. Although the 
characteristics for selecting slide show images from candidate still images 
may be discussed separately, it is to be understood that the selection may 
be based on any one characteristic or some combination of characteristics. 

35 

To select slide show images based on a desired bit rate, the 
conversion system 110 performs a bit rate assessment. For the assessment, 
the conversion system 110 deducts the bandwidth required for the audio 
component from the total bandwidth available to determine the bandwidth 
40 required for the streaming slide show component (i.e., an image bit rate 

budget) . Then, using the bandwidth required for the streaming slide show 



component and knowing the compression of each still image, the conversion 
system 110 determines the total number of slide show images that can be 
transmitted to maintain a desired bit rate. The total number of slide show 
images to be transmitted is calculated by multiplying the time required for 
transmitting the audio component with the image bit rate budget and 
dividing by an image size (i.e., the size of one of the slide show images). 

Continuing with the discussion of how the conversion system 110 
selects slide show images based on obtaining a desired bit rate, the 
desired bit rate may obtained in several ways. For example, the desired 
bit rate may be user-specified or may be based on a quality target (e.g., a 
specified amount of time to download) . For example, if the conversion 
system 110 can transmit one slide show image every 10 seconds (i.e., to 
obtain a bit rate equal to the number of bits of an image divided by 10 
seconds) , the conversion system 110 may select one candidate still image at 
every 10 second mark using the time base references associated with the 
images. In particular, there may be several candidate still images at or 
near a 10 second mark, and the conversion system 110 selects one of these. 
Selection may be based on various factors, for example, the middle' 
candidate still image may be selected from a range around the 10 second 
mark. If no candidate still image is available at a 10 second mark, then 
the conversion system 110 selects, for example, a candidate still image 
whose time base reference is closest to and less than the 10 second mark or 
it may repeat the previous image. 

To select slide show images based on similarity of content with the 
previous image, the conversion system 110 may use a tool to select 
candidate key frames that provides ranking of the frames. In particular, a 
rank ordering is provided along with the candidate key frames over, for 
example, a period of time. That is, over a one second interval, the 
candidate key frames selected in that interval are ranked. 

To select slide show images based on other characteristics, the 
conversion system 110 may, for example, select candidate key frames so 
that if a single or very similar image is repeated over a relatively long 
time period, that image would be repeated only often enough to meet the 
minimum bandwidth constraints. If a series of rapidly changing images 
occur over a brief time period, only the most predominant images would be 
selected and included to stay below the maximum bandwidth constraints. 
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In block 212, once the conversion system 110 has a collection of 
slide show images and an audio component, both with sufficient timing 
information relative to each other to allow them to be combined in a 
synchronized manner, the conversion system 110 combines the selected slide 
5 show images with the audio component. 

This combination process is one of the advantages of this invention 
in that the conversion system 110 combines the audio component with the 
selected slide show images for a particular bit rate. The invention is 
10 further advantageous in that the audio component is linked with slide show 

images in such a manner (multiplexed or otherwise combined in a format that 
allows synchronized playback using standard players from a standard bit 
stream) that playback of the images flows smoothly and presents most or 
all of the critical information that was contained in the video. 

15 

In block 214, the conversion system 110 transmits the streaming slide 
show, in particular, for transmission, the conversion system 110 may break 
up each slide show image into portions, interleave each slide show image 
portion with audio, and transmit this combination to a user at a client 

20 computer. At the client computer, the conversion system 110 reforms a 

slide show image from the portions for that slide show image. Then, the 
slide show image is displayed and its associated audio is played. In an 
alternate embodiment, the conversion system 110 may transmit the audio for 
a slide show image and all of the portions for that slide show image 

25 separately and then combine the audio and slide show image portions as the 

audio and slide show image portions are received (i.e., "on the fly"). 

Thus, the conversion system 110 automates the steps used for 
conversion, based on user definable parameters that determine the target 
data rate, the level of compression, image size, the priority of specific 

30 key frame types, etc. The benefits of a fully automated system include the 

automatic generation of high quality still image slides from a high bit 
rate video for low bit rate access techniques (e.g. , transmission of data 
over a network), while still maintaining the full or maximum screen quality 
and resolution. 

35 

There are, of course many alternative embodiments for accomplishing 
the present invention. For example, any type of computer, such as a 
mainframe, minicomputer, or personal computer, or computer configuration, 
such as a timesharing mainframe, local area network, or standalone personal 
40 computer, could be used in the present invention. 
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The foregoing description of an embodiment of the invention has been 
presented for the purposes of illustration and description. It is not 
intended to be exhaustive nor to limit the invention to the precise form 
disclosed. Many modifications and variations are possible in light of the 
above teaching. It is intended that the scope of the invention be limited 
not by this detailed description, but rather by the claims appended hereto. 
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CLAIMS 

1. A method of processing a video stored on a data store connected to a 
computer, the method comprising the steps of: 
5 receiving a motion-based video comprised of a series of images; 

selecting one or more images from the motion-based video based on a 
desired bandwidth for transmission; and 

generating a streaming slide show using the selected images. 

10 2. A method according to claim 1, wherein selecting one or more images 

further comprises determining a particular number of images to select. 

3. A method according to claim 2, wherein the motion-based video 
further comprises audio and wherein the number of images to select is based 

15 on multiplying a time required for transmitting the audio with an image bit 

rate budget and dividing by an image size. 

4. A method according to any preceding claim, further comprising, prior 
to selecting one or more images, selecting one or more candidate frames. 

20 

5. A method according to claim 4, further comprising ranking the 
selected candidate frames. 

6. A method according to claim 4 or 5, further comprising generating 
25 candidate images from the candidate frames. 

7. A method according to claim 6, wherein selecting the one or more 
images from the motion- based video comprises selecting from among the 
candidate images. 

30 

8. A method according to claim 1 or 2, wherein the received motion-based 
video further comprises audio. 

9. A method according to claim 8, wherein the audio is synchronized with 
35 the selected images. 

10. A method according to claim 9, wherein the synchronization is 
performed using time base references associated with the selected images 
and with the audio. 
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11. A method according to any preceding claim, wherein the desired 
bandwidth is obtained from user input. 

12. A method according to any of claims X to 10, wherein the desired 
5 bandwidth comprises a constant bit rate. 

13. A method according to any of claims 1 to 10, wherein the desired 
bandwidth comprises a variable bit rate. 

10 14. Apparatus for processing a video, comprising: 

a computer having a data store coupled thereto, wherein the data 
store stores the video; and 

one or more computer programs, performed by the computer, for 
receiving a motion-based video comprised of a series of images, selecting 
15 one or more images from the motion-based video based on a desired bandwidth 

for transmission, and generating a streaming slide show using the selected 
images . 

15. Apparatus according to claim 14, wherein selecting one or more images 
20 further comprises determining a particular number of images to select. 

16. Apparatus according to claim 15, wherein the motion-based video 
further comprises audio and wherein the number of images to select is based 
on multiplying a time required for transmitting the audio with an image bit 

25 rate budget and dividing by an image size. 

17. Apparatus according to any of claims 14 to 16, further comprising, 
prior to selecting one or more images, selecting one or more candidate 
frames . 

30 

18. Apparatus according to claim 17, further comprising ranking the 
selected candidate frames. 

19. Apparatus according to claim 17 or 18, further comprising generating 
35 candidate images from the candidate frames. 

20. Apparatus according to claim 19, wherein selecting the one or more 
images from the motion-based video comprises selecting from among the 
candidate images. 
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21. Apparatus according to claim 14 or 15, wherein the received 
motion-based video further comprises audio. 

22. Apparatus according to claim 21. wherein the audio is synchronized 
with the selected images. 

23. Apparatus according to claim 22, wherein the synchronization is 
performed using time base references associated with the selected images 
and with the audio. 

24. Apparatus according to any of claims 14 to 23, wherein the desired 
bandwidth is obtained from user input. 

25. Apparatus according to any of claims 14 to 23, wherein the desired 
bandwidth comprises a constant bit rate . 

26. Apparatus according to any of claims 14 to 23, wherein the desired 
bandwidth comprises a variable bit rate. 

27. An article of manufacture comprising a program storage medium 
readable by a computer and embodying one or more instructions executable by 
the computer to perform method steps for processing a video stored on a 
data store connected to the computer, the method comprising the steps of: 

receiving a motion-based video comprised of a series of images; 
selecting one or more still images from the motion-based video based 
on a desired bandwidth for transmission; and 

generating a streaming slide show using the selected images. 

28. An article of manufacture according to claim 27, wherein selecting 
one or more images further comprises determining a particular number of 
images to select. 

29. An article of manufacture according to claim 28, wherein the 
motion-based video further comprises audio and wherein the number of images 
to select is based on multiplying a time required for transmitting the 
audio with an image bit rate budget and dividing by an image size . 

30. An article of manufacture according to claim 29, further comprising, 
prior to selecting one or more images, selecting one or more candidate 
frames . 
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31. An article of manufacture according to claim 30, further comprising 
ranking the selected candidate frames. 

32. An article of manufacture according to claim 30 or 31, further 
comprising generating candidate images from the candidate frames. 

33. An article of manufacture according to claim 32, wherein selecting 
the one or more images from the motion-based video comprises selecting from 
among the candidate images. 

34. An article of manufacture according to claim 27, wherein the received 
motion-based video further comprises audio. 

35. An article of manufacture according to claim 34, wherein the audio is 
synchronized with the selected images. 

36. An article of manufacture according to claim 35, wherein the 
synchronization is performed using time base references associated with 
the selected images and with the audio. 

37. An article of manufacture according to any of claims 27 to 36, 
wherein the desired bandwidth is obtained from user input. 

38. An article of manufacture according to any of claims 27 to 36, 
wherein the desired bandwidth comprises a constant bit rate. 

39. An article of manufacture according to any of claims 27 to 36, 
wherein the desired bandwidth comprises a variable bit rate. 

40. A method, apparatus or article of manufacture, substantially as 
hereinbefore described with reference to the accompanying drawings 
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DETAILED ACTION 
Response to Arguments 
Applicant's arguments filed 12/22/2008 have been fully considered but they are 
not persuasive. 

In re page 13 line 15 -21 , Applicants argue with that Kato does not recite 
playitem includes duration information indicating whether to display the at least one still 
picture for one a finite time period. 

In response the examiner respectfully disagrees. Kato discloses a playitem (Figs. 
2 and 32) includes duration information indicating whether to display the at least one 
st'il picture (Video stream clip of a playitem)for one a finite time period (injime and 
outjime are finite amount of time that are used to playback a playitem 0280-0281 ). 

Claim Rejections • 35 USC §112 
1. Claim 1, 26 , 27, 28 and 29 rejected under 35 U.S.C. 112, first paragraph, as 
failing to comply with the written description requirement. The claim(s) contains subject 
matter which was not described in the specification in such a way as to reasonably 
convey to one skilled in the relevant art that the inventors), at the time the application 
was filed, had possession of the claimed invention. Specification does not mention and 
"an infinite period of time until user input is received". Specification only mentions 
reproduction of still images having infinite duration in an order set forth by the play list 
(0045). 
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C/a/m Rejections - 35 X/SC 5 103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the Invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claimsl -4, 6-1 1 , 26-31 , 33-35,37-38,40-42,44-45,47-49,51-52 and 54-56 are 
rejected under 35 U.S.C. 103(a) as being anticipated by U.S. Patent Pub. 
2002/0164152 A1 to Kato et al. ("Kato") in view of U.S. Patent Pub. 2002/0130896 
A1 to Spence et al. "Spence" 

As to claims 1, 26 and 27 Kato discloses a recording medium having a data 
structure for managing reproduction of still pictures, comprising: a playlist area storing at 
least one playlist file (0154; 01 72; playlist is set of playback domains), the playlist file 
including at least one playitem (Figs. 7, 39) (0154; 0172), at least one sub-playitem (sub 
playitem, Fig. 7) and mark information (0160) (0188-0190), 

the playitem providing navigation information (EP_Map; Fig 67; 0347-0350) for 
reproducing presentation data (Video data and ancillary data, 0170) from a first stream 
file (AV stream file), the presentation data including the at least one still picture (video 
data includes picture information; Fig. 39) and associated data (Ancillary data or dip 
information file;0170) and not including audio data (Clip can be video or audio, Fig. 71; 
0352-0353), the presentation data (video data) being divided into still picture units (play 
items) in the first stream file such that each still picture unit (Fig. 83) includes a still 
picture and associated data (video and ancillary data or clip information file; 01 70; Figs. 
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2 and 67), the playitem including duration information (in_time and outjime, Fig. 32; 
0280-0281) indicating whether to display the at least one still picture for one of a finite 
period (in and out time is finite time) (0280-0281 ). 

the sub-playitem (Fig.7) associated with the playitem (Fig. 7)and providing 
navigation information (EP_Map; Fig. 67;0347-0350) for reproducing audio data from a 
second stream file (audio stream; 0349-0350), and the mark information including at 
least one mark pointing to the still picture (160;01 88-01 92)(0298)(Fig. 83). 

Kato does not expressly disclose displaying of atleast one still picture an infinite 
period of time until user input is received. 

Spence discloses displaying of atleast one still picture an infinite period of time 
until user input is received (001 1 , 0096). 

At the time of invention, it would have been obvious to a person of ordinary skill 
in the art to combine Kato with the teachings of Spence. Motivation to combine would 
have been so that images are displayed until a user has provided an input. So that a 
user has a greater control of a slide show. 

As to claims 2, 30, 37, 44. 61 , Kato further discloses wherein the at least one 
mark includes a type indicator indicating that the mark is of a type used for pointing to a 
still picture (0193-0194). 

As to claim 3, Kato further discloses wherein the at least one mark includes a 
time stamp indicating a time address of the still picture in the first stream file (0189). 

As to claim 4, Kato further discloses wherein the at least one mark includes a 
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playitem indicator indicating the playitem with which the at least one mark is associated 
(01 89) (0190) (Fig. 44) (0294). 

As to claims 6, Kato further discloses wherein the mark includes a time stamp 
indicating a time address of the still picture in the first stream file (0189) (0299). 

As to claims 7, Kato further discloses wherein the mark includes a playitem 
indicator indicating the playitem with which the mark is associated (0189) (0190) (Fig. 
44) (0294). 

As to claim 8, Kato further discloses wherein the at least one mark includes a 
time stamp indicating a time address of the still picture in the first stream file (0189) 
(0299). 

As to claim 9, Kato further discloses wherein the at least one mark includes a 
playitem indicator indicating the playitem with which the atleast one mark is associated 

(0189) (0190) (Fig. 44) (0294). 

As to claim 10, Kato further discloses wherein the mark information includes a 
number of marks, and the mark information includes a number indicator indicating the 
number of marks (0298). 

As to claim 11, Kato further discloses wherein, for each mark, the mark 
information provides a type indicator indicating a type of the at least one mark (0189) 

(01 90) (Fig. 44) (0294) (0298) (Fig. 43). 

As to claim 28, Kato discloses an apparatus for recording a data structure for 
managing reproduction of at least one still image on a recording medium, the apparatus 
comprising; 
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a pick up configured to record data on the recording medium (Fig. 1, Readout unit 26); 
a controller configured to control the pick up (Fig. 1 , 26) to record at least one playlist 
file on the recording medium (Fig. 1 , controller 23), 

the playlist file including at least one playitem (Fig. 7)(01 54), at least one sub-playitem 
(Fig. 7) and mark information (01 60X01 88-01 90). 

the playitem providing navigation information (EP_Map; Fig 67; 0347-0350) for 
reproducing presentation data (Video data and ancillary data, 0170) from a first stream 
file (AV stream file), the presentation data including the at least one still picture (video 
data includes picture information; Fig. 39) and associated data (Ancillary data or clip 
information file;0170) and not including audio data (Clip can be video or audio, Fig. 71; 
0352-0353), the presentation data (video data) being divided into still picture units (play 
items) in the first stream file such that each still picture unit (Fig. 83) includes a still 
picture and associated data (video and ancillary data or clip information file; 0170; Figs. 
2 and 67), the playitem including duration information (injime and outjime, Fig. 32; 
0280-0281) indicating whether to display the at least one still picture for one of a finite 
period (in and out time is finite time)(0280-0281 ). 

the sub-playitem (Fig.7) associated with the playitem (Fig. 7)and providing 
navigation information (EP_Map; Fig. 67;0347-O350) for reproducing audio data from a 
second stream file (audio stream; 0349-0350), and the mark information including at 
least one mark pointing to the still picture (160;01 88-01 92X0298)(Fig. 83). 

Kato does not expressly disclose displaying of atleast one still picture an infinite 
period of time until user input is received. 
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Spence discloses displaying of atleast one still picture an infinite period of time 
until user input is received (001 1 , 0096). 

At the time of invention, it would have been obvious to a person of ordinary skill 
in the art to combine Kato with the teachings of Spence. Motivation to combine would 
have been so that images are displayed until a user has provided an input. So that a 
user has a greater control of a slide show. 

As to claim 29, Kato discloses an apparatus for reproducing a data structure for 
managing reproduction of at least one still image recorded on a recording medium, 
comprising: 

a pick up configured to reproduce data recorded on the computer readable medium(Fig. 
1, readout unit, 28); 

a controller configured to control the pickup (readout unit, 28) to reproduce at least one 
playlist file from the recording medium, (Fig. 1 , controller 23), 
the playlist file including at least one playitem (Fig. 7X0154), at least one sub-playitem 
(Fig. 7) and mark information (0160)(0188-0190), 

the playitem providing navigation information (EP J/lap; Fig 67; 0347-0350) for 
reproducing presentation data (Video data and ancillary data, 0170) from a first stream 
file (AV stream file), the presentation data including the at least one still picture (video 
data includes picture information; Fig. 39) and associated data (Ancillary data or clip 
information file;0170) and not including audio data (Clip can be video or audio, Fig. 71; 
0352-0353), the presentation data (video data) being divided into still picture units (play 
items) in the first stream file such that each still picture unit (Fig. 83) includes a still 
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picture and associated data (video and ancillary data or clip information file; 0170; Figs. 
2 and 67), the playitem Including duration information (in_time and outjime, Fig. 32; 
02.80-0281) indicating whether to display the at least one still picture for one of a finite 
period (in and out time is finite time)(0280-0281 ). 

the sub-playitem (Fig.7) associated with the playitem (Fig. 7)and providing 
navigation information (£P_Map;. Fig. 67;0347-0350) for reproducing audio data from a 
second stream file (audio stream; 0349-0350), and the mark information including at. 
least one mark pointing to the still picture (160;01 88-01 92)(0298XFig. 83). 

Kato does not expressly disclose displaying of atleast one still picture an infinite 
period of time until user input is received. 

Spence discloses displaying of atleast one still picture an infinite period of time 
until user input is received (001 1, 0096). 

At the time of invention, it would have been obvious to a person of ordinary skill 
in the art to combine Kato with the teachings of Spence. Motivation to combine would 
have been so that images are displayed until a user has provided an input. So that a 
user has a greater control of a slide show. 

As to claims 31, 38, 45, 52, Kato further discloses wherein the at least one mark 
includes a time stamp indicating a time address of the still picture in the first stream file, 
and the at least one mark includes a playitem indicator indicating the playitem with 
which the at least one mark is associated (01 89-01 90XFig. 44)(0294). 

As to claims 33, 40, 47, 54, Kato further discloses wherein the at least one mark 
includes a time stamp indicating a time address of the still picture in the first stream file 



Application/Control Number 10/759,461 Page 9 

Art Unit: 2621 

(0189;0299), and the at least one mark includes a playitem indicator indicating the 
playitem with which the at least one mark is associated (Fig. 44,01 89;0190;0294). 

As to claims 34, 41 , 48, 55, Kato further discloses wherein the at least one mark 
includes a time stamp indicating a time address of the still picture in the first stream file 
(0189:0299), and the at least one mark includes a playitem indicator indicating the 
playitem with which the at least one mark is associated (Fig. 44; 01 89-01 90;0294). 

As to claims 35, 42, 49, 56,Kato further discloses wherein the mark information 
includes a number of marks, and the mark information includes a number indicator 
indicating the number of marks, and for each mark, the mark information provides a 
type indicator indicating a type of the at least one mark (Figs. 43-44; 0189-0190; 
0294,0298). 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the Invention was made. 

5. Claims 5, 32, 39, 46 and 53 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over U.S. Patent Pub. 2002/0164152 A1 to Kato et al. ("Kato") in 
view of U.S. Patent Pub. 2002/0130896 A1 to Spence et al. "Spence" 

and in further view of U.S. Patent Pub. 2005/01 63463 A1 to Schick et al. 
("Schick"). 
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As to claims 5, 32, 39, 46, and 53, Kato and Spence as modified discloses 
everything claimed as applied in claims 1,26, 27, 28 and 29 above. In addition Kato 
discloses wherein the at least one mark includes a mark type indicator indicating that 
the at least one mark is of a type that provides a point to skip to (Fig. 43). Kato and 
Spence as modified do not expressly disclose when displaying a slideshow of still 
pictures. 

Schick discloses displaying a slideshow of still pictures (See figs. 4, 7,1 6 and 

0143). 

At the time of invention, it would have been obvious to a person of ordinary skill 
in the art to combine Kato and Spence as modified with Schick. Motivation would have 
been to provide a skipping function having a "skip increment" in a slide show to skip 
between multiple images. 

6. Claims 36, 43, 50 and 57 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over U.S. Patent Pub. 2002/0164152 A1 to Kato et al. ("Kato") in 
view of U.S. Patent Pub. 2002/0130696 A1 to Spence et al. "Spence" and in view of 
U.S. Patent 6,122.436 to Okada et ai. "Okada" and in further view of U.S. Patent 
6,856,756 B1 to Mochizuki et al. "Mochizuki" 

As to claims 36, 43, 50, 57, Kato and Spence as modified disclose everything . 
claimed as applied in claims 26, 27, 28 and 29 above. Kato and Spence as modified do 
not expressly disclose wherein the associated data to be graphic data and/or subtitle 
data. 



Application/Control Number: 10/759,461 Pagel 
Art Unit 2621 

Okada discloses wherein the related data to be subtitle data (Subtitles, lines 34- 

49). 

At the time of invention, it would have been obvious to a person of ordinary skill 
in the art to combine Kato and Spence as modified with the teachings of Okada. 
Motivation to combine would have been to provide data in subtitles so that additional 
data could be provided to a viewer. 

Kato, Spence and Okada as modified do not expressly disclose related data to 
be graphic data. 

Mochizuki discloses related data to be graphic data (Col. 5, lines 15-47). 

At the time of invention, it would have been obvious to a person of ordinary skill 
In the art to combine Kato, Spence, and Okada as modified with the teachings of 
Mochizuki. Motivation to combine would have been to provide data in graphics so that 
more types of data could be provided to a viewer through subtitles. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to ASHER KHAN whose telephone number is (571 )270- 
5203. The examiner can normally be reached on 9:00 AM to 5:00 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Marsha Banks-Harold can be reached on (571)272-7905. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-21 7-91 97 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571^272-1000. 

/Marsha D. Banks-Harold/ 

Supervisory Patent Examiner, Art Unit 2621 



/A. K./ 

Examiner, Art Unit 2621 
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